TUTA1 at the NTCIR-10 Intent Task

نویسندگان

  • Haitao Yu
  • Fuji Ren
چکیده

In NTCIR-10, we participated in the subtask of Subtopic Mining. We classify test topics into two types: role-explicit topic and roleimplicit topic. According to the topic type, we devise different approaches to perform subtopic mining. Specifically, for roleexplicit topics, we propose an approach of modifier graph based subtopic mining. The key idea is that: The modifier graph corresponding to a role-explicit topic is decomposable into clusters with strong intra-cluster interaction and relatively weak inter-cluster interaction. Each modifier cluster intuitively reveals a possible subtopic. For role-implicit topics that generally express single information needs, we directly generate the ranked list through semantic similarities leveraging on lexical ontologies. The evaluation results show that our best Chinese subtopic mining run gets the first position among all the runs in terms of # D nDCG . However, our English subtopic mining runs show a poor performance, which is planned to be further improved in our future work.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

TUTA1 at the NTCIR-12 Temporalia Task

Our group submitted task for Temporal Intent Disambiguation (TID) Subtask (Chinese) of NTCIR-2012. We using word2vec to model query String into feature vector, and using cos function to measure the similarity between query string and training corpus SougouCA. Our results shows the approach is efficient for solving thoes Task.

متن کامل

TUTA1 at the NTCIR-11 Temporalia Task

This paper details our participation in the NTCIR-11 Temporalia task including Temporal Query Intent Classification (TQIC) and Temporal Information Retrieval (TIR). In the TQIC subtask, we explore the rich temporal information in the labeled and unlabeled search queries. Semi-supervised and supervised linear classifiers are learned to predict the temporal classes for each search query. In the T...

متن کامل

TUTA1 at the NTCIR-11 IMine Task

In this paper, we detail our participation in two subtasks: subtopic mining and document ranking of the NTCIR-11 IMine task. In the subtopic mining subtask, to discover the latent hierarchy among query-like strings, our key idea is to structurally parse query-like strings by characterizing pairwise dependency in the bag-of-units perspective. Then the clustering algorithm (i.e., affinity propaga...

متن کامل

RMIT and Gunma University at NTCIR-9 Intent Task

In this report, we describe our experimental results for the NTCIR-9 intent task. For our experiments, we use our experimental search engine, Newt. Newt is a ranked selfindex capable of supporting multiple languages by deferring linguistic decisions until query time. To our knowledge, this is the first Information Retrieval task on the ClueWeb09-JA collection performed entirely with ranked self...

متن کامل

KYOTO at the NTCIR-12 Temporalia Task: Machine Learning Approach for Temporal Intent Disambiguation Subtask

This paper describes the Kyoto system for Temporal Intent Disambiguation (TID) subtask in the NTCIR-12 Temporal Information Access (Temporalia-2) challenge. The task is to estimate the distribution of temporal intents (Past, Recency, Future, Atemporal) of a given query. We took a supervised machine learning approach, using features of bag of words, POS and word vectors. We also incorporated kno...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013